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The following is a quotation of the appropriate paragraphs of 35 U.S. C. 102 that form the 
basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 35 1(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 

Claims 1-9 and 11-15 are rejected under 35 U.S.C. 102(e) as being anticipated by Taira 
(U.S. Pre-Grant Publication US 2003/0105638. 

Claim 1: Paragraph 0084 outlines the step of gathering a large number of medical 
reports for the purpose of establishing a set of training data (paragraph 0086, lines 1-4). The 
medical reports constitute the corpus of text and includes both abbreviations and expansions of 
the abbreviation. An example is given in paragraph 0088, where the abbreviation is "5cm" and 
the expansion of the abbreviation is the word "large" in paragraph 0090. 

The expansion is a text processing step identified in paragraphs 0089-0092, where one 
logical relation is created having the expansion word "large" as one of the three optional possible 
logical relations for the correct meaning of "5cm". 

Context information is created as a text processing step by generating a context vector 
"b" (see FIG. 10A and paragraph 0098), which is used as part of a larger probability formula to 
determine the probability that the logical relation is true in a specific context. 

The result is a set of mathematical probabilities which are the training data that the 
knowledge base uses to correctly determine whether the expansion is related to the phrase 
"5cm". 
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Claim 2: The feature vector "b" (FIG. 10A and paragraph 0098) is readable as local 
context information because it based on information from the immediately surrounding sentence 
of a particular word (paragraph 0097, lines 5-6). 

Claim 3: The local context information is the context vector b and is based upon the 
context within a given sentence (paragraph 0097, lines 5-6). The training data, which is the 
resulting probability calculation using the context vector b, would also be dependent upon that 
sentence. 

Claim 4: Sentences inherently include words, so the sentence level information used to 
establish the context vector "b" word inherently end up being based on specific words. 

Claim 5: The context information is the context vector "b" (FIG. 10A and paragraph 
0098, lines 3-6). The context vector is defined in terms of feature functions "f(a, b)", with these 
feature functions being readable as discourse context information. Both the context vector "b" 
and feature functions are stored in order to permit the probability calculation shown in paragraph 
0096. 

Claim 6: The "discourse context information" are the feature functions "f(a,b)". The 
"discourse training data" is the resulting probability calculation using these function in paragraph 
0096. All of these functions are calculations are based on the characteristics of sample text 
provided to generate the training data (paragraph 0084). 

Claim 7: The local context information is the context vector "b", while the discourse 
context information are the feature functions "f(a,b)" The underlying numerical values in the 
respective vector and functions are the respective "local context data" and "discourse data", 
lacking any further clarity in the ciaim as to exactly what these entities are. 
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Claim 8: The vector "b" described in paragraph 0096 through line 6 of paragraph 0098 is 
readable as a feature vector and defines the context information. 

Claim 9: The probability calculation shown in paragraph 0096 is a maximum entropy 
model (as described in FIG. 10b) and utilizes feature vectors "b". It determines the probability 
that an expanded version of abbreviated text, such as the word "Large" can be accurately related 
to an abbreviated phrase, such as "5 cm" (see paragraphs 0088 through 0092). 

Claim 11: The text is processed using a statistical model (probability model of 
paragraph 0096 and FIG. 10b which illustrates the same model). 

Claim 12: Paragraph 0088 gives an example of an abbreviation in the training text, such 
as the phrase "5cm". Paragraph 0090 gives an example of an expansion of the abbreviation, such 
as the word "Large". The training text (paragraph 0084) is processed to produce and identify the 
expansion of the abbreviation. 

Claim 13: Paragraph 0084 outlines the collection of a corpus of documents used to create 
a knowledge base, which correlates to a database. The database includes both abbreviation data 
(phrases such as "5cm") and expanded versions which are logically related to the abbreviations 
(words such as "Large" — see paragraphs 0098 and 0090). As described in paragraphs 0089- 
0092, logical relations are generated (three are shown in paragraphs 0090-0092) that identify at 
least one expansion (the word "Large" in paragraph 0090 is an expansion). For each of the three 
logical relations, a feature vector "b" is generated which includes context data (paragraph 0098, 
lines 3-5). The vectors are generated and stored so as to permit the calculation of probability that 
one of the three logical relations most closely correlates to the abbreviation when used in a 
specific context. 
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Claim 14: See remarks for claims 2 and 5. 
Claim 15: See remarks for claim 11. 

Claim 10 is objected to as being dependent upon a rejected base claim, but would be 
allowable if rewritten in independent form including all of the limitations of the base claim and 
any intervening claims. 

Remarks 

Applicant's arguments and amendments have been considered. 

Applicant first argues that the Taira reference does not teach the general concept of 
abbreviation normalization. Examiner has fully considered this assertion but does not agree. 
Paragraphs 0088 through 0092 teach the concept of developing logical expansions for 
abbreviations in medical text. In this particular example, logical relations are developed for sizes 
that may be used in the text, such as the size "5 cm". One such logical relation is the word "large" 
as an expansion of the specified size of "5 cm". Examiner maintains that the step of developing 
these logical relations meets the claimed requirements for processing the corpus of text (i.e. the 
source text) to identify expansions ("large" being an expansion of the abbreviation "5cm"). 

Applicant also argues that claim 1 has been amended to define an automated processing 
of text. However, claim 1 has only been amended to define the "processing" feature and does not 
require that nay of the steps be performed by an automated system. Accordingly, even the 
manual process described in Taira at paragraphs 0088 through 0092 meets the currently defined 
claim limitations. 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 . 136(a). 



Application/Control Number: 1 0/6 1 3,26 1 Page 6 

Art Unit: 2165 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

Any inquiry concerning this communication should be directed to Sam Rimell at 
telephone number (571) 272-4084. 

Sam Rimell 
Primary Examiner 
Art Unit 2165 




